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(54) RHEUMATOID ARTHRITIS GENE AND METHOD FOR DIAGNOSING RHEUMATOID 
ARTHRITIS 



(57) As a disease gene for rheumatoid arthritis 
present in human chromosome X and a method of di- 
agnosing rheumatoid arthritis, a disease gene, which is 
a mutant of protooncogene Dbl transcribing an mRNA 
that encodes the cDNA of which the sequence from the 
2679th to 2952nd bases is shown in SEQ ID NO: 1, 



which disease gene transcribes an mRNA encoding the 
cDNA of which the region from the 20th to 274th bases, 
in SEQ ID NO: 1 is substituted with thesequence of SEQ 
ID NO: 2, and a method for diagnosing rheuma toid ar- 
thritis by detecting the mRNA of the above-described 
gene or its expression product in a biological specimen, 
is provided. 
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Description 

Technical Field 

5 [0001] The present invention relates to the disease gene of rheumatoid arthritis present in the human X chromosome 
and a method for diagnosing rheumatoid arthritis by detecting the presence of the disease gene or its expression 
product. 

Background Art 

10 

[0002] Although aspects, particularly the pathological process, of arthritis and arthritis mutilans which cause rheu- 
matoid arthritis, have been clarified through various investigations, because most autoimmune diseases associated 
with rheumatoid arthritis developed or worsen into the disease only when various causative factors coincide the in- 
teraction itself of multiple factors must be clarified to understand the disease and to develop appropriate methods of 
15 ' treatment. ' 

[0003] The number of patients with rheumatoid arthritis in the world is 1% or less (N. Engl. J. Med. 322* 1 277-1289 
1 990), but among sibilings of patients, over 8% develop the disease (Cell. 85: 31 1-318,1 996), which leads to the notion 
that some genetic factor may be involved. However, molecular genetic procedures and genetic engineering processes 
used conventionally to discover the genetic factor of diseases may not be effective for autoimmune diseases Such 
problem is caused by the fact that autoimmune diseases do not develop through mechanisms as simple as those of 
cancer, wherein abnormal growth of one mutated gene occurs. Further, although classical genetic procedures which 
search for genetic basis of a disease revealed that autoimmune diseases are caused by multiple genetic factors it has 
not been successful in discovering its entrails or its body. Thus, almost nothing about the entity, or even the locus of 
genes associated with rheumatoid arthritis has been known. 

[0004] By performing linkage analysis using microsatellite markers on rheumatoid arthritis patients arid their relatives 
the present inventors identified three loci of rheumatoid arthritis genes (International Immunology 10(12)- ■1-891-1 89s' 
1998; Journal of Clinical Rheumatology 4 (3) : 156-158, 1998) and filed a patent application for the following disease . 
genes (PCT/JP98/01665). . : 4 r_ . . y 

30 (1) a disease gene of rheumatoid arthritis located within ±1 centi Morgan vicinity of a DNA sequence on human 

chromosome 1 to which microsatellite marker(s) D1S214 and/or D1S253 hybridize(s). 

(2) A disease gene of rheumatoid arthritis located within ±1 centi Morgan vicinity of a DNA sequence on human 
chromosome 8. to which microsatellite marker D8S556 hybridizes. 

(3) A disease gene of rheumatoid arthritis located within ±1 centi Morgan vicinity of a DNA sequence on human 
35 chromosome X to which microsatellite marker(s) DXS1001, DXS1047, DXS 1205, DXS 1227 and/or DXS1232 hy- 

bridize(s). 

[0005] The present inventors identified, as a result of further studies on each of the rheumatoid arthritis genes spec- 
ified in the above-described previous application, the specific gene regarding the disease gene (3) described above 
40 and determined its molecular structure. 

Disclosure of Invention 

[0006] In order to solve the above-describedproblems, the present invention provides a disease gene for rheumatoid 
45 arthntts, which is a mutant of protooncogene Dbl transcribing an mRNA that encodes the cDNA of which the sequence 
from the 2679th to 2952nd bases is shown in SEQ ID NO: 1 , which disease gene transcribes an mRNA encoding the 
of wh,cn toe region from the 20th to 274th bases in SEQ ID NO: 1 is substituted with the sequence of SEQ ID 

NO: 2. 

[0007] The present invention also provides a cDNA of the above disease gene, a DNA fragment, which is a part of 
such cDNA, a protein expressed by the above disease gene, a peptide which is a part of such protein, and an antibody 
against such protein. 

[0008] Further, the present invention provides a method for diagnosing rheumatoid arthritis comprising the detection 
of the mRNA from the above disease gene or the above protein in a biological specimen. 
[0009] The present invention further provides a method for the functionally complementing Dbl deficiency. 

Best Modes for Carrying Out the Invention 

[0010] Hereinafter, embodiments of the present invention having the above-described characteristics will be de- 
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scribed. 

[001 1] The rheumatoid arthritis disease gene of the present invention (hereinafter referred to as "RA disease gene") 
is a variant sequence of known protooncogene Dbl gene (EMBO J. 7(8): 2463-2473, 1988; GenBank Accession No. 
X 12556) which is isolated from human chromosome X by the method described in the after-mentioned Examples. In 

5 other words, this Dbl gene transcribes the mRNA encoding the cDNA for which the sequence of the 2679th to 2952rid 
bases is represented in SEQ ID NO: 1 , while in the cDNA of the variant gene, the sequence of the 3' side of the 241st 
base in SEQ ID NO: 1 is linked to the downstream side of the 1 8th base to induce a frame shift in amino acid translation, 
causing the 19th to 274th base in SEQ ID NO: 1 to be substituted by the sequence shown in SEQ ID NO: 2. Fig. 1 
• shows the base sequence of the 2679th to 2952nd bases (same as SEQ ID NO: 1 ) of Dbl gene cDNA in a normal, the 

10 corresponding base sequence of RA disease gene, and the respective amino acid sequences (1 letter notation) en 
coded by these sequences. 

[0012] In addition, generally, polymorphism of individual differences is often found for human genes. Thus, the RA. 
disease gene of the present invention may include genes that code cDNAs obtained by the addition, deletion or sub- 
stitution of one or more nucleotide in SEQ ID NO: 2. Likewise, the present invention also includes proteins with one 

'5 or more amino acid added to, deleted from and/or substituted, produced by such change to the base. 

[001 3J The cDNAs of the present invention may easily be isolated by, for example, the method described in the after- 
mentioned Example. Further, the cDNAs of the.present invention may be cloned from a cDNA library produced by a 
known method (Mol. Cell. Biol. 2:161-170, 1982; J. Gene 25: 263-269, 1983; Gene 150: 243-250, 1994) using poly(A) 
+RNA extracted from cells of a patient with rheumatoid arthritis. Such cloning may be performed by, for example, 

20 synthesizing oligonucleotides based on the sequence information provided by the present invention and screening by 
colony or plaque hybridization by a known method using the resultant oligonucleotides as probes. Also, oligonucle- 
otides, which hybridize to both ends.of the target cDNA fragment, may be synthesized, and using them as primers, 
the cDNA of the present invention may be produced by RT- PCR method from mRNAs isolated from cells of a patient 
with rheumatoid arthritis. 

25 [001 4] The DNA fragment of the present invention comprises a portion of the aforesaid cDNA, and contains the base 
sequence shown in SEQ ID NO: 3. In other words, SEQ ID NO: 3 is the underlined sequence in Figure 1, and is a 
characteristic region, which is not present in normal Dbl gene or its cDNAs. Further, the DNA fragment includes both 
sense and antisense strands. These DNA fragments may be used as probes for genetic diagnosis . 
[0015] The proteins of the present invention are expression products resulting from the RA disease genes of the 

30 present invention, and has the amino acid sequence shown in SEQ ID NO: 2 at its C-terminal. These proteins may be 
obtained by chemical peptide synthesis method based on the ami no acid sequence provided by the present application, 
or by recombinant DNA technique using cDNAs provided by the present application. For example, when recombinant 
DNA technique is used to obtain the proteins, RNA may be prepared by in vitro transcription using a vector containing, 
the cDNA of the present invention; using this RNA as a template, the proteins may be obtained by in vitro translation. 

35 Also, the coding region of the cDNA may be recombined into an appropriate expression vector by any known method, 
and the recombinant vector obtained may be used to transform E. coli., Bacillus subtilis, yeast, animal cells or the like, 
whereby expression of the protein in bulk would be possible using these recombinant cells. 

[0016] When in vitro translation is used to produce the proteins.of the present invention, the coding region of the 
cDNA of the present invention may be recombined into a vector with RNA polymerasepromoter, and introduced into 
to the in vitro translation system containing the RNA polymerase corresponding to the promoter, such as rabbit reticular 
erythrocyte lysate or wheat embryo extracts. T7, T3 and SP6 may be listed as examples of the RNA polymerase 
promoter. Examples of vectors, which contain any of these RNA polymerase promoters are pKA1 , pCDM8, pT3/T7 18, 
pT7/3 19 and pBluescript II. ■ t 

[0017] Furthermore,.when the proteins of the present invention are expressed using microorganisms such as E. coli. t 
« a recombinant expression vector may be prepared by incorporating the coding region of the cDNA of the present 
invention into an expression vector which contains replication origin replicable in microorganism, promoter, ribosome- 
binding site, cDNA cloning site, terminator and the like, which is then used to transform a host cell and incubating the 
transformed ceil. In such cases, by adding initiation and termination codons before and after an arbitrary coding region, 
protein fragments, which contain the arbitrary region may be obtained. Alternatively, the protein may be obtained as a 
so fusion protein with another protein. By cleaving the fusion protein using an appropriate protease, the target protein 
may also be isolated.. Examples of the expression vector for E. coli. are pUC system, pBluescript II, pET expression 
system and pGEX expression system. 

[0018] When expressing the protein of the present invention in eucaryotic cells, the coding region of the inventive 
cDNA may be incorporated into an expression vector for eucaryotic cells that contains a promoter, a splicing region, 
55 a poly(A) addition site and the like, which may be introduced into eucaryotic cells. Such expression vectors may be 
pKA1 , pCDMS, pSVK3, pMSG, pSVL, pBK-CMV, pBK-RSV, EBV vector, pRS and pYES2. Generally, mammal culture 
cells such as monkey kidney cell COS7 or Chinese hamster ovarian cell CHO, budding yeast, fission yeast, silkworm 
cells and Xenopus iaevis o-site cells are used as eucaryotic cells, but in the present invention, they are not limited to 
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S!m X Z P J«h' , T ° in T!fr eXpression vector int0 ^aryotic cells, any known method such as e.ectroporation. 
calcium phosphate method, liposome method, and DEAE dextran method may be used 

LnLlL?™ PTOteinS f T Pre T d procar y° tic or eucaryotic cells by the above-described methods, the protein 
of interest may be separated from the culture and purified by using combinations of known separationipurification 
oil ^P'f ar f" treatment with degenerating agents such as urea or surfactant, ultrasonlation. enzyme d" 
gestion salt- or solvent-precipitation, dialysis, centrifugation, ultrafiltration, gel filtration, SDS-PAGE, taoef^ctrtcfoais- 
££!£^ hydrOPh0b,C Ch "9raphy, affinity chromatography, reverse phase 

o^{J^p^ mte> " - Pr6Sent inVenti ° n 3,80 e - nCOmpaSSeS fu6ion P^sol the present protein with 

o°rmIL -"if PreSem inV ! nti ° n iS 3 P6Ptide fra9ment ' WhiCh COn,ains at least P art < 5 a ™™ acW residues 

an aniibody ^ sequence shown in SEQ ID N 9 : 2. Such peptide may be used as an antigen for preparing 

mfm L ™* a t !! tib0dy °' thepreSent invention be obtained as a Polyclonal or monoclonal antibody by any known 
method using the protein itsetf or a partial peptide thereof as antigen. ' 
[0023] The method for diagnosing rheumatoid arthritis of the present invention may be performed, for example by 
^ reSenCe w CharaC,e ^ tip * RNAS transcrlb6d b y RA disease gene in a biological specimen (body fluid 
2 ° b a,n6d U ° m t a . aUbjeCt ! Such m f NA may be detected by, for example. RT-PCR ampliation of the rnRNA con- 
taining the characteristic region (e.g., the underlined region in Figure 1), or by in vitro or in situ hybridization analysis 
using any characteristic sequence region of the mRNA for RA disease gene as a probe - 
S^L^T 1 " 0 ^ 7 thodfo ; dia 9 nosin g Rheumatoid arthritis of the present invention may also be performed 

SLec S It h Pre T °! Pr0 ' ,n(S) 6XpreSSed ,r ° m RA disease 9 ene ih a Wological specimen of a subject. Such 
detection may be performed by. for example, enzyme immunoassay or radioimmunoassay using the antibody of the 

kir rl^mn k h J PrC f enCe °' SUCh 9ene 6XpreSsion or P rotein ™V be d «ected by "sing any diagnosis 
be used • hybnd ff , . 0n analy ^ , kit *** as DNA cnip and the like or immunoassay kit such as ELISA kit may 

we°igh!compounds defe<it °' ™* '* com P iemente ti ftj. for ' example. ;proteiri or low molecular 
Examples ■; 

[0026] Hereinafter, the RA disease gene of the present invention will be described in further detail through the fol- 
lowing examples; however, the present invention l is not limited to these examples: .. «a» inrougn the fol 

<Example 1> Identification of the RA disease gene 

f™«in° r '^Kf "! analySi !, b / affeCted sibl Pair analysis method using microsatellite marker. DMAs were prepared 
Z£2 nT T d T 8 ,ami,y °' ""° * eum *° id arthritis patients and one normal, by the guanfdine- 

thiocyanate method (The Japan Society of Blood Transfusion Report 40(2), 413). Further 11 makers (DXS1047 

3^0 T/4 DXS8 ° 4 ] - ?L S8094 : 0X811 ^ DXS1205 ' DXS1227 ' DXS8106; DX8043. DX8028 i^S5SSS2S 
^^nrT- 8 f^- m ^^ enW ^ With hete ™y9° si ty than about 0.7. from the range of the 

^1 f ,?*r , IT preV,OUS, y disc,osed ^ tno present inventors (International Immunology 10 (12) : 1891-1895' 
Journal of Clinical Rheumatology 4(3): 156-158. 1 998). and fluorescence-labeled primers that could amplify eachioci 

Lmo^ 9,00 ^ ' SOlated by PCR unde r^ efo,lowin 9 oonditions. The reaction solution was pre^ 

Spmol of pnmer, approximately O.Sug of template DNA, 1 .5ug of Buffer II (Perkin Elmer Inc.), 1 .OpJ of 2mM dNTP Mix 

(Perkin Elmer Inc.), 0.12*1 of AmpliTaq Gold enzyme (Perkin Elmer Inc.) and 0.9ui of 25mM MgCU (PerkTn E^ Inc 

and add.ng sterilizedwater to obtain a total volume of 15^1. The reacflon was performed in a therm^ 

of MJ Research Inc. First, one cycle of enzyme activation at 95'C for 1 2 minute, 10 cycles of heat denaturation at 94«C 

IS, ?TT?' T eaM " 9 31 47 ' C f0f ° ne minUte and extension at 72 ° C for 2 Chutes were performed, after 
Z vJ, T 31 denatura,,on at for one minute, primer annealing at 47»C for one minute arid extension 

S~r 7n° r x m ' nU A t S ^^ e P erformed - Each ° f the resultant DNA fragments were analyzed in a DNA sequencer (Perkin 
Elmer Inc.. Type AB1377) by subjecting to electrophoresis with size markers for Genescan (Perkin Elmer Inc.) of the 
manufactures specification, and the DNA analysis Was performed by using the attached softwares. Genescan and 
^ n !^ e , r o«f da,a . obtalned were analyzed on Unix system using Mapmaker Sibs software (Am J Hum Genet. 57. 
439-454. 1 995). which is available to the public, for genetic linkage analysis, and the maximum Lod value was calculated 
oy single point analysis. 

[0028] As a result, me maximum Lod was determined to be 2.03 for DXS984, which is located in the 0.1 centi Morgan 
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vicinity of DXS1232, one of the candidate genetic loci disclosed by the present inventors (International Immunology 
10 (12) : 1891-1895; Journal of Clinical Rheumatology 4 (3) : 156-158, 1998) . showing significant correlation. By 
searching the international data base on the internet (Genemap98, http://www.ncbi. nim. nih. gov/ genemap98/), it was 
found that the physical location of DXS984 was 4259 cR1 0000(F) on the G3 Radiation hybrid map, and thus it was 
5 proved that the protooncogene Dbl was situated nearest to DXS984. 

<Example 2> Analysis of Abnormal Dbl Gene 

[0029] In order to compare the cDNAs between Dbl genes, cDNA was synthesized by reverse transcription using 
10 RT-PCR kit (Perkin Elmer Inc.) from the total RNA obtained from peripheral blood of RA disease patients collected 
using Isogen agent (Nippongene Co. Ltd.), and dissolved in 20uJ of sterilized water. Furthermore, primers (SEQ ID 
NO: 4 and 5) were prepared using the Dbl cDNA sequence (Genbank Accession No. X1 2556) (Amersham Pharmacia); 
and part of the Dbl cDNA sequence was isolated by the PCR method. The composition of the reaction solution for PCR 
was : 10 pmol each of forward primer (SEQ ID NO: 4) and reverse primer (SEQ ID NO: 5), approximately 0.1 u.g of 
is template. DNA, 2.5u.l of LA-PCR buffer (Takara Shuzo Co. Ltd.), 4.0u.l of 2.5mM dNTP Mix, 0.25uJ of LA Taq enzyme 
(Takara Shuzo Co. Ltd.) and 2.5uJ of 25mM MgCI 2 mixed, after which sterilized water was added to obtain a total 
volume of 25pl The reaction was performed in a thermal cycler (PTC-200) of MJ Research by repeating 35 cycles of 
the process of heat denaturation at 94°C for 30 seconds, primer annealing at 52°C for 30 seconds and extension at 
.. 72°C for 2 minutes. The PCR products were subjected to electrophoresis of conventional methods, in TAE buffer 
20 solution using 1 % Agarose L (Nippongene Co. Ltd.) gel and DNA molecular weight markers (200bp ladder) by Prbmega 
Co., to confirm the amplified bands. As a result, it was found that the size of normal DNA was 660bp while the size of 
DNA chain from some patients were distinctly shorter (approximately 440bp). 

[0030] Next, after each respective bands were cut out, the gels were melted at 65° C for 1 0 minutes, and the DNAs 
were purified by conventional phenol extraction methods and ethanol precipitation methods. Then, using 100ng of the 

25 resultant DNA as a template, cycle sequence reaction and purification were performed following the specifications of 
the manufacturer of BigDye terminator cycle sequence kit by Perkin Elmer Inc., and the sequence was determined by 
a Type AB1 377 DNA sequencer of Perkin Elmer Inc. As a result, it was evident that in the above-described abnormally 
short DNA, as shown in Fig. 1 , the 223bp from the number 2697 to number 2919 bases are deleted, making it 437 bp. 
This result indicates that with the amino acid deletion encoded in the genetic information downstream of base niimber 

30 2693, and by inducing frame shifty abnormal polypeptide chain short of 65 amino acids is produced. 

Industrial Applicability 

[0031 ] As described in detail above, the present invention provides a disease gene for rheumatoid arthritis occurring 
35 jn human chromosome X. This invention enables the easy and reliable diagnosis of rheumatoid arthritis. Furthermore, 
this invention is useful for the development of novel treatment and therapeutic agents for rheumatoid arthritis. 
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SEQUENCE LISTING 

<110> Shun ichi SHI OZAWA 

<120> RHEUMATOID ARTHRITIS GENE AND METHOD FOR DIAGNOSING RHEUMATOID ARTHRITIS 
<140> PCT/JPOO/01697 
<14t> 1999-10-23 
<150> JP1 1-116933 
<1 51 > 1999-03-20 
• <160> 5 ' ' ] . ' 

<170) Patentln Ver. 2.0 
<210> 1 

<211> 274 . . : 
' <212> UNA ' 
<213> Hobjo sapiens 
<221> CDS 

<222> (2) . (271) • ..; 

<308> GenBank Accession No. XI 2556 

<400> 1 

t ctt cag cag aat gat gaa aag caa cag gga get ttt ata agt act gag 49 

Leu Gin Gin Asn Asp Glu Lys Gin Gin Gly Ala Phe lie Ser Thr Glu 

1 5 10 15 

gaa act gaa ttg gaa cac ace age act gtg gtg gag gtc tgt gag gca 97 
Glu Thr Glu Leu Glu His Thr Ser Thr Val Val Glu VaV Cys Glu Ala 

20 25 30 

att gcg tea gtt cag gca gaa gca aat aca gtt tgg act gag gca tea 145 
He Ala Ser Val Gin Ala Glu Ala Asn Thr Val Trp Thr Glu Ala Ser 

35 40 45 

caa tct gta gaa ate tct gaa gaa cct gcg gaa tgg tea age aac tat 193 
Gin Ser Val Glu Me Ser Glu Glu Pro Ala Glu Trp Ser Ser Asn Tyr 

50 55 50 

ttc tac ccc act tat gat gaa aat gaa gaa gaa aat agg ccc etc atg 241 
Phe Tyr Pro Thr Tyr Asp Glu Asn Glu Glu Glu Asn Arg Pro Leu Met 
65 70 75 80 

aga cct gtg teg gag atg get etc eta tat tga 274 
Arg Pro Val Ser Glu Met Ala Leu Leu Tyr 
85 90 
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20 



2S 



<210> 2 
<211> 61 
<212> ONA 
<213> Hobo sapiens 
<221> CDS 
<222> (1)., (60) 
<400> 2 

a gac ctg tgt egg aga tgg etc tee tat aft gat gaa get act atg tea 49 
Asp Leu Cys Arg Arg Trp Leu Ser Tyr lie Asp Glu Ala Thr Met Ser 
1 5 10 15 

aat ggc aag tag g, 
Asn Gly Lys 

19 ■ ... 

<210> 3 
<211> 10 
<212> DMA 
<213> Homo sapiens 
<400> 3 

30 atgaagacct 10 

<210> 4 
<211> 19 
<212> ONA 

<213> Artificial sequence 
<220> 

<223> Synthesized oligonucleotide 
<400> 4 

ggctagattc aaaccaatg tg 
<210> 5 
45 <211> 17 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthesized oligonucleotide 
<400> 5 

gctacttgcc atttgac 17 
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Claims 

^rnhtf a °ol e SeqU ! nCe fTOm ,hS 2679,h ,0 2952nd bases is shown in S£ Q 'D NO: 1 , which disease gene 
transcnbes an mflNA encoding the cDNA of which the region from the 20th to 274th bases in SEQ ID NO 1 is 
substituted with the sequence of SEQ ID NO: 2. ■ 

2. The cDNA of the disease gene of claim 1 . 

3 ' s^quTnc'rs'QTD'No!^ " diS6aSe 9606 " 1 " *" ^ « CWm 2 " C °" tains the base 

4 ' c ^nTSh f h" eXp ^ 0 n R r °^pt of th e disease gene of claim 1 ..wherein the amino acid sequence of the 
C-terminal is that shown in SEQ ID NO: 2. 

5 ' show^S ID So: T ° f Pr ° tein ° f C ' aim 4 ' C ° ntainS Partia ' SeqUenCe ° f the an,in ° acid se " uence 

6. An antibody against the protein of claim 4 or the peptide of claim 5. 

7. A method for diagnosing rheumatoid arthritis, comprising the detection of the mRNA from the disease gene of 
claim 1 or the protein of claim 4, in a biological specimen. 

8. A method for functionally complementing Dbi deficiency. 
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Figure 
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